41 research outputs found

    Genome-wide estimation of gender differences in the gene expression of human livers: Statistical design and analysis

    Get PDF
    BACKGROUND: Gender differences in gene expression were estimated in liver samples from 9 males and 9 females. The study tested 31,110 genes for a gender difference using a design that adjusted for sources of variation associated with cDNA arrays, normalization, hybridizations and processing conditions. RESULTS: The genes were split into 2,800 that were clearly expressed (expressed genes) and 28,310 that had expression levels in the background range (not expressed genes). The distribution of p-values from the 'not expressed' group was consistent with no gender differences. The distribution of p-values from the 'expressed' group suggested that 8 % of these genes differed by gender, but the estimated fold-changes (expression in males / expression in females) were small. The largest observed fold-change was 1.55. The 95 % confidence bounds on the estimated fold-changes were less than 1.4 fold for 79.3 %, and few (1.1%) exceed 2-fold. CONCLUSION: Observed gender differences in gene expression were small. When selecting genes with gender differences based upon their p-values, false discovery rates exceed 80 % for any set of genes, essentially making it impossible to identify any specific genes with a gender difference

    Analyzing 2D gel images using a two-component empirical bayes model

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Two-dimensional polyacrylomide gel electrophoresis (2D gel, 2D PAGE, 2-DE) is a powerful tool for analyzing the proteome of a organism. Differential analysis of 2D gel images aims at finding proteins that change under different conditions, which leads to large-scale hypothesis testing as in microarray data analysis. Two-component empirical Bayes (EB) models have been widely discussed for large-scale hypothesis testing and applied in the context of genomic data. They have not been implemented for the differential analysis of 2D gel data. In the literature, the mixture and null densities of the test statistics are estimated separately. The estimation of the mixture density does not take into account assumptions about the null density. Thus, there is no guarantee that the estimated null component will be no greater than the mixture density as it should be.</p> <p>Results</p> <p>We present an implementation of a two-component EB model for the analysis of 2D gel images. In contrast to the published estimation method, we propose to estimate the mixture and null densities simultaneously using a constrained estimation approach, which relies on an iteratively re-weighted least-squares algorithm. The assumption about the null density is naturally taken into account in the estimation of the mixture density. This strategy is illustrated using a set of 2D gel images from a factorial experiment. The proposed approach is validated using a set of simulated gels.</p> <p>Conclusions</p> <p>The two-component EB model is a very useful for large-scale hypothesis testing. In proteomic analysis, the theoretical null density is often not appropriate. We demonstrate how to implement a two-component EB model for analyzing a set of 2D gel images. We show that it is necessary to estimate the mixture density and empirical null component simultaneously. The proposed constrained estimation method always yields valid estimates and more stable results. The proposed estimation approach proposed can be applied to other contexts where large-scale hypothesis testing occurs.</p

    Sample size calculation for microarray experiments with blocked one-way design

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>One of the main objectives of microarray analysis is to identify differentially expressed genes for different types of cells or treatments. Many statistical methods have been proposed to assess the treatment effects in microarray experiments.</p> <p>Results</p> <p>In this paper, we consider discovery of the genes that are differentially expressed among <it>K </it>(> 2) treatments when each set of <it>K </it>arrays consists of a block. In this case, the array data among <it>K </it>treatments tend to be correlated because of block effect. We propose to use the blocked one-way ANOVA <it>F</it>-statistic to test if each gene is differentially expressed among <it>K </it>treatments. The marginal p-values are calculated using a permutation method accounting for the block effect, adjusting for the multiplicity of the testing procedure by controlling the false discovery rate (FDR). We propose a sample size calculation method for microarray experiments with a blocked one-way design. With FDR level and effect sizes of genes specified, our formula provides a sample size for a given number of true discoveries.</p> <p>Conclusion</p> <p>The calculated sample size is shown via simulations to provide an accurate number of true discoveries while controlling the FDR at the desired level.</p

    Parallel multiplicity and error discovery rate (EDR) in microarray experiments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array.</p> <p>Results</p> <p>This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small.</p> <p>Conclusions</p> <p>Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.</p

    Transcriptional Homeostasis of a Mangrove Species, Ceriops tagal, in Saline Environments, as Revealed by Microarray Analysis

    Get PDF
    <div><h3>Background</h3><p>Differential responses to the environmental stresses at the level of transcription play a critical role in adaptation. Mangrove species compose a dominant community in intertidal zones and form dense forests at the sea-land interface, and although the anatomical and physiological features associated with their salt-tolerant lifestyles have been well characterized, little is known about the impact of transcriptional phenotypes on their adaptation to these saline environments.</p> <h3>Methodology and Principal findings</h3><p>We report the time-course transcript profiles in the roots of a true mangrove species, <em>Ceriops tagal</em>, as revealed by a series of microarray experiments. The expression of a total of 432 transcripts changed significantly in the roots of <em>C. tagal</em> under salt shock, of which 83 had a more than 2-fold change and were further assembled into 59 unigenes. Global transcription was stable at the early stage of salt stress and then was gradually dysregulated with the increased duration of the stress. Importantly, a pair-wise comparison of predicted homologous gene pairs revealed that the transcriptional regulations of most of the differentially expressed genes were highly divergent in <em>C. tagal</em> from that in salt-sensitive species, <em>Arabidopsis thaliana</em>.</p> <h3>Conclusions/Significance</h3><p>This work suggests that transcriptional homeostasis and specific transcriptional regulation are major events in the roots of <em>C. tagal</em> when subjected to salt shock, which could contribute to the establishment of adaptation to saline environments and, thus, facilitate the salt-tolerant lifestyle of this mangrove species. Furthermore, the candidate genes underlying the adaptation were identified through comparative analyses. This study provides a foundation for dissecting the genetic basis of the adaptation of mangroves to intertidal environments.</p> </div
    corecore